Skip to content

Revert "[PERF] Decouple projections from GDN custom op"#28080

Merged
simon-mo merged 1 commit intovllm-project:mainfrom
CentML:revert-27512-vadim/refac-gdn
Nov 4, 2025
Merged

Revert "[PERF] Decouple projections from GDN custom op"#28080
simon-mo merged 1 commit intovllm-project:mainfrom
CentML:revert-27512-vadim/refac-gdn

Conversation

@vadiklyutiy
Copy link
Collaborator

Reverts #27512

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request reverts a previous change that decoupled input/output projections from the GDN custom operation. The changes are consistent across all files, effectively re-integrating the projection logic into the gdn_attention custom op's execution path. The modifications, including renaming the custom op from gdn_attention_core back to gdn_attention and adjusting the related function signatures and logic flow, appear to be a correct and complete revert of the original performance optimization attempt. I have reviewed the changes and found no new issues of high or critical severity introduced by this revert.

…7512)

Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>
@vadiklyutiy vadiklyutiy force-pushed the revert-27512-vadim/refac-gdn branch from 3d3da84 to fb128a1 Compare November 4, 2025 23:57
@simon-mo simon-mo merged commit d4e547b into vllm-project:main Nov 4, 2025
4 of 5 checks passed
@vadiklyutiy vadiklyutiy deleted the revert-27512-vadim/refac-gdn branch November 5, 2025 00:01
ZhengHongming888 pushed a commit to ZhengHongming888/vllm that referenced this pull request Nov 8, 2025
…#28080)

Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>
devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025
…#28080)

Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

qwen Related to Qwen models

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants